Evolution of the Twist Subfamily Vertebrate Proteins: Discovery of a Signature Motif and Origin of the Twist1 Glycine-Rich Motifs in the Amino-Terminus Disordered Domain
نویسندگان
چکیده
Twist proteins belong to the basic helix-loop-helix (bHLH) family of multifunctional transcriptional factors. These factors are known to use domains other than the common bHLH in protein-protein interactions. There has been much work characterizing the bHLH domain and the C-terminus in protein-protein interactions but despite a few attempts more focus is needed at the N-terminus. Since the region of highest diversity in Twist proteins is the N-terminus, we analyzed the conservation of this region in different vertebrate Twist proteins and study the sequence differences between Twist1 and Twist2 with emphasis on the glycine-rich regions found in Twist1. We found a highly conserved sequence motif in all Twist1 (SSSPVSPADDSLSNSEEE) and Twist2 (SSSPVSPVDSLGTSEEE) mammalian species with unknown function. Through sequence comparison we demonstrate that the Twist protein family ancestor was "Twist2-like" and the two glycine-rich regions found in Twist1 sequences were acquired late in evolution, apparently not at the same time. The second glycine-rich region started developing first in the fish vertebrate group, while the first glycine region arose afterwards within the reptiles. Disordered domain and secondary structure predictions showed that the amino acid sequence and disorder feature found at the N-terminus is highly evolutionary conserved and could be a functional site that interacts with other proteins. Detailed examination of the glycine-rich regions in the N-terminus of Twist1 demonstrate that the first region is completely aliphatic while the second region contains some polar residues that could be subject to post-translational modification. Phylogenetic and sequence space analysis showed that the Twist1 subfamily is the result of a gene duplication during Twist2 vertebrate fish evolution, and has undergone more evolutionary drift than Twist2. We identified a new signature motif that is characteristic of each Twist paralog and identified important residues within this motif that can be used to distinguish between these two paralogs, which will help reduce Twist1 and Twist2 sequence annotation errors in public databases.
منابع مشابه
Correction: Evolution of the Twist Subfamily Vertebrate Proteins: Discovery of a Signature Motif and Origin of the Twist1 Glycine-Rich Motifs in the Amino-Terminus Disordered Domain
[This corrects the article DOI: 10.1371/journal.pone.0161029.].
متن کاملIn silico structural analysis of quorum sensing genes in Vibrio fischeri
Quorum sensing controls the luminescence of Vibrio fischeri through the transcriptional activator LuxR and the specific autoinducer signal produced by luxI. Amino acid sequences of these two genes were analyzed using bioinformatics tools. LuxI consists of 193 amino acids and appears to contain five α-helices and six ß-sheets when analyzed by SSpro8. LuxI belongs to the autoinducer synthetase fa...
متن کاملCaspase Cleavage Motifs of Influenza Subtypes Proteins: Alternations May Switch Viral Pathogenicity
Background and Aims: The caspases are unique proteases that mediate the host cell apoptosis during viral infection. In this study, we identified the caspase cleavage motifs of H5N1 and H9N2 influenza viruses isolated during 1998-2012. Materials and Methods: Amino acid sequences of the eleven proteins encoded by the viruses as the caspase substrates downloaded from NCBI. The caspase cleavage mot...
متن کاملP-84: Characterization of Androgen Receptor Structure and Nucleocytoplasmic Shuttling of the Rice Field Eel
Background: Androgen receptor (AR) plays a critical role in prostate cancer and male sexual differentiation.Mechanisms by which AR acts and regulations of AR nucleocytoplasmic shuttling are not understood well. Materials and Methods: Degenerate PCR and RACE Cloning of AR Gene; Phylogenetic Analysis and Molecular Modeling;Real-time Fluorescent Quantitative RT-PCR; Northern Blot Hybridization;In ...
متن کاملDesigning Of Degenerate Primers-Based Polymerase Chain Reaction (PCR) For Amplification Of WD40 Repeat-Containing Proteins Using Local Allignment Search Method
Degenerate primers-based polymerase chain reaction (PCR) are commonly used for isolation of unidentified gene sequences in related organisms. For designing the degenerate primers, we propose the use of local alignment search method for searching the conserved regions long enough to design an acceptable primer pair. To test this method, a WD40 repeat-containing domain protein from Beauveria bass...
متن کامل